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BACKGROUND OF THE INVENTION 



Field of the Invention 

5 [0001] This invention relates to multithreading processors and, more particularly, to 
managing the execution of multiple threads in a multithreading processor. 

Description of the Related Art 

10 [0002] Multithreading typically refers to executing multiple threads from one or more 
processes at a particular time. One type of multithreading processor is capable of 
executing multiple threads without using multiple cores. Multithreading may provide 
capabilities similar to a multiprocessor system (two or more logical processors) while 
using just one physical processor. A process refers to an application that is being 

1 5 executed by a processor. The process may consist of a private virtual address space, 
executable code, data and other operating system resources. Also, a process may have 
one or more execution threads. A thread, which is sometimes referred to as a lightweight 
process or execution context, is an execution path or flow of control through a given 
process. A thread's current state, which is represented by a current program counter, a 

20 call/return stack, the processor's registers, and some thread-specific data, allows the 
processor to freeze a thread (i.e., stop executing it) and to continue execution at a later 
time from the point where the thread was stopped. 

[0003] Sharing a single processor between multiple threads may minimize the time 
25 required to switch between threads because a minimum amount of information may need 
to be saved and restored when switching from one thread to another. A minimum amount 
of information may need to be saved and restored because the program execution 
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environment is shared between the different threads. This is in contrast to a standard 
context switch in a normal execution process. 

[0004] Programmers write multithreaded application software to enhance the 
5 performance of an application. Since multithreaded programs may be written to execute 
in parallel instead of serially, multithreading may save execution time by allowing parts 
of a process to be independently executed. 

[0005] Although current multithreading technologies may speed up the execution of 
10 processes by executing multiple threads at one time, a significant amount of execution 
time may be wasted on particular threads having instructions that are not ready to be 
executed. One method to overcome this problem is to switch from one thread to another 
or suspend execution of a current thread when the system detects that the current thread is 
taking too long to execute. For example, if execution of an instruction results in a cache 
15 miss or unavailability of resources during execution of the current thread, then the 
processor may switch to another thread that is ready to be executed. However, this 
method may require executing the particular instruction that caused the cache miss, which 
may inherently result in wasted execution time. 

20 



25 
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SUMMARY OF THE INVENTION 



[0006] Various embodiments of a processor and method for supporting compiler 
directed multithreading management are disclosed. In one embodiment, a processor 
5 includes an execution unit configured to execute one or more threads and a detection unit 
coupled to detect whether a given thread includes an identifier. In addition, the execution 
unit may be configured to selectively continue execution of the given thread depending 
upon whether the detection unit detects the identifier. 

10 [0007] In another embodiment, a method of generating low-level instructions 

executable by a processor includes providing a computer program including high-level 
programming instructions and detecting whether an indicator is included within the 
computer program. The method further includes, in response to detecting the indicator, 
generating a low-level instruction having an identifier corresponding to the indicator. 

15 The low-level identifier may be configured to cause the processor to selectively continue 
execution of a given thread. 



20 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0008] FIG. 1 is a block diagram of one embodiment of a multithreading processor. 

5 [0009] FIG. 2 is a diagram illustrating two processes having multiple execution 
threads. 

[0010] FIG. 3 is a flow diagram describing the operation of one embodiment of the 
processor of FIG. 1. 

10 

[0011] FIG. 4 is a flow diagram describing the operation of one embodiment of the 
processor of FIG. 1. 

[0012] FIG. 5 is a block diagram of another embodiment of a multithreading 
15 processor. 

[0013] FIG. 6 illustrates a flow diagram describing the operation of one embodiment 
of the processor of FIG. 5. 

20 [0014] FIG. 7 is a diagram illustrating a compiler for converting high-level 
programming instructions to low-level instructions. 

[0015] FIG. 8 is a flow diagram describing the operation of one embodiment of the 
compiler of FIG. 7. 

25 

[0016] FIG. 9 is a block diagram of one embodiment of a computer system including a 
multithreading processor. 
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[0017] While the invention is susceptible to various modifications and alternative 
forms, specific embodiments thereof are shown by way of example in the drawings and 
will herein be described in detail. It should be understood, however, that the drawings 
and detailed description thereto are not intended to limit the invention to the particular 
5 form disclosed, but on the contrary, the intention is to cover all modifications, equivalents 
and alternatives falling within the spirit and scope of the present invention as defined by 
the appended claims. 



10 
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DETAILED DESCRIPTION 



[0018] Turning now to FIG. 1, a block diagram of one embodiment of a 
multithreading processor 100 is shown. Processor 100 includes a system interface unit 
5 110 coupled to a prefetch unit 1 15, an instruction cache 120, and to a data cache 130. 
Processor 100 also includes a decode unit 140 coupled to the instruction cache 120 and to 
a scheduling unit 150. In addition, a detection unit 160 is coupled to the scheduling unit 
150 and to the decode unit 140. Further, execution unit(s) 170 is coupled to the 
scheduling unit 150 and the data cache 130. 

10 

[0019] It is noted that processor 100 may be representative of a processor in the 
SPARC™ family of processors. However, it is contemplated that in other embodiments, 
processor 100 may be representative of other types of processors such as a processor in 
the x86 family of processors, for example. 

15 

[0020] In the illustrated embodiment, system interface unit 1 10 is coupled to provide 
processor 100 with access to system resources. For example, prefetch unit 115 may 
prefetch instructions from memory through system interface unit 110 and temporarily 
store the instructions in instruction cache 120. In one embodiment, the instructions may 
20 be decoded by decoding unit 140 and sent to the scheduling unit 150. Scheduling unit 
150 is configured to schedule the decoded instructions for execution in the execution 
pipeline. In one embodiment, scheduling unit 150 may store the decoded instructions in 
an execution queue in a particular execution order. 

25 [0021] In the illustrated embodiment, processor 100 is a multithreading processor. 
More particularly, execution unit(s) 1 70 may be configured to execute one or more 
execution threads in a pipelined manner. For example, execution unit(s) 170 may access 
the queue to retrieve the next instruction to execute. In addition, data cache 130 is 
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configured to provide any necessary data to execution unit(s) 170 during execution of a 
given thread. In one embodiment, processor 100 may be a superscalar processor. As 
such, execution unit (s) 170 may be an execution core including multiple integer 
execution units, multiple floating-point units, and a load/store unit, which are not shown 
5 for simplicity. In another embodiment, execution unit(s) 170 may include additional 
execution cores. However, it is contemplated that in other embodiments the execution 
unit(s) 170 may include a single integer execution unit and a single floating point unit. 

[0022] In the illustrated embodiment, detection unit 160 is configured to detect 
whether a current thread (i.e., the thread for which instructions are currently being 
executed or the thread for which instructions are being fetched) includes an identifier. In 
one embodiment, the identifier may be indicative that the current thread may be 
suspended and a different thread may be executed. In addition, execution unit 170 is 
configured to selectively continue execution of the current thread depending upon 
whether the detection unit detects the identifier. In one embodiment, detection unit 160 
may detect whether a decoded instruction includes the identifier. Although in other 
embodiments, detection unit 160 may also be coupled to data cache 130 (as denoted by 
the dashed lines) to detect whether a particular piece of data that may be used to execute 
the current thread may include the identifier. It is noted that although detection unit 160 
is shown as a separate component of processor 100, it is contemplated that in alternative 
embodiments, the functionality of detection unit 160 may be included within other 
functional blocks such as decoding unit 140 or scheduling unit 150 for example. 

[0023] In one embodiment, processor 100 may include counter logic (not shown) that 
25 tracks how long a current thread is taking to execute from the point at which the decision 
is made to continue execution of the current thread. In such an embodiment, counter 
logic may be configured to count a number of execution cycles. If processor 100 is still 
waiting on the current thread after a predetermined number of execution cycles, processor 



10 
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100 may switch to a different thread. It is noted however that in other embodiments, 
processor 100 may track how long a current thread is taking to execute using various 
other methods, such as counting processor clock cycles, for example. 



5 [0024] As will be described in greater detail below, the identifier may be included 
within a thread as an instruction, the identifier may be part of an instruction or the 
identifier may be included as part of a piece of data consumed during execution of a 
thread. In addition, the identifier may be information that is associated with the 
instruction. 

10 

[0025] Referring to FIG. 2, a diagram illustrating two processes having multiple 
execution threads is shown. In the illustrated embodiment, Process A and Process B 
represent two independent processes that may be executed by a processor, such as 
processor 100 of FIG. 1. Process A is divided into three execution threads and these are 
15 designated Tl, T2, and T3, and Process B is divided into five execution threads and these 
are designated T4, T5, T6, T7, and T8. However, it is noted that a particular process may 
be divided into any number of threads as desired. Each of the execution threads includes 
one or more instructions and, as noted above, each thread may be executed in parallel to 
take advantage of the multithreading capabilities of the processor. 

20 

[0026] Furthermore, as described above, a given thread may include an identifier. In 
the illustrated embodiment, the shaded areas correspond to instructions associate with an 
identifier and the unshaded areas correspond to instructions without an identifier. For 
example, Instruction 1 (II) is an instruction having an identifier and Instruction 4 (14) is 
25 an instruction that does not have an identifier. In addition, an execution thread may 

include one or more instructions having an identifier. For example, Thread 2 (T2) is an 
execution thread that has two different instructions (12, 13) each having an identifier. On 
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the other hand, a thread may not have any identifiers. For example, none of the 
instructions in Thread 3 (T3) have an identifier. 

[0027] In one embodiment, the identifier (sometimes referred to as a hint) associated 
5 with a given thread is configured to inform processor 100 that the execution of the given 
thread may be suspended until it can be more efficiently executed or that a different 
thread may be executed before the given thread. For example, the given thread may 
include an identifier to communicate to processor 100 that it will be subject to a MUTEX 
if it executes a particular instruction within the given thread. A MUTEX may refer to a 

10 mutual exclusion object that allows multiple threads to synchronize access to a shared 
resource (sometimes called mutual exclusion of threads). For example, a MUTEX may 
be used to protect a section of code from being executed by more than one thread. Since 
a MUTEX may block other threads from being executed, the identifier may inform 
processor 100 to suspend execution of this particular thread, which is subject to the 

1 5 MUTEX, and to execute a different thread. 

[0028] In another embodiment, an identifier may be indicative that the current thread 
may be suspended and a different thread may be executed depending on one or more 
execution parameters. For example, as will be discussed in greater detail below, 
20 processor 100 may suspend a current thread and execute a different thread if an identifier 
is detected and a priority scheme indicates to switch to the other thread. 

[0029] In an alternative embodiment, the identifier may be indicative that processor 
100 should continue to execute the current thread and to override a global execution 
25 parameter. As used herein, a global execution parameter may be used to inform hardware 
how to respond to a particular event or operation. For example, a global execution 
parameter may, in the absence of an identifier, cause processor 100 to unconditionally 
suspend execution of the current thread due to a cache miss caused by execution of a 
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particular instruction and to switch to a different thread. However, the presence of the 
identifier may indicate to processor 100 that there will only be a short wait time (i.e., a 
local cache miss) and to continue execution of the current thread, thereby overriding the 
global execution parameter. In this instance, execution time corresponding to the latency 
5 associated with a thread switch may be saved. For example, depending on the specific 
implementation, a thread switch may take two clock cycles; therefore, it may be more 
efficient not to switch threads. 

[0030] In one embodiment, the identifier may be a flag included within a particular 
10 instruction of the given thread. For example, one or more unused op-code bits or other 
bits of the particular instruction may be used. In another embodiment, the identifier may 
be a unique instruction within the given thread. In such an embodiment, the sole function 
of the unique instruction may be to inform the processor that the current thread may be 
suspended if another thread is waiting to execute. In yet another embodiment, the 
1 5 identifier may be information that is associated with a given instruction but may follow a 
different path than the given instruction. For example, the identifier may be included 
within data or metadata that is associated with a given instruction. The data including the 
identifier associated with the instruction may be similar to information stored along with 
a cache line in a cache memory, for example. 

20 

[0031] Depending on the implementation, the identifier may include one or more bits 
that may be encoded to have different meanings. For example, one encoding may be 
associated with causing the processor to suspend execution of the current thread and to 
switch to a different thread, while a different encoding of the same identifier may cause 
25 the processor to continue execution. 

[0032] FIG. 3 is a flow diagram describing the operation of one embodiment of the 
multithreading processor of FIG. 1. Referring collectively to FIG. 1 and FIG. 3, processor 
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100 is a multithreading processor configured to execute one or more threads at a time. In 
one embodiment, the software scheduler of the operating system allocates memory space 
for a particular thread that is ready to be executed. The address of the particular thread is 
placed in a run queue to wait to be executed. Prefetch unit 1 15 fetches the instructions 
5 associated with a given thread, which is now the current thread, and once the instructions 
are decoded, execution unit(s) 170 may begin to execute the instructions associated with 
the current thread (block 310). During execution of the current thread, detection unit 160 
is configured to detect whether the current thread includes an identifier (block 320). 
Specifically, in one embodiment, the detection unit 160 is configured to detect whether 
10 one or more decoded instructions of the current thread include an identifier or are 
associated with an identifier. It is noted however that detection unit 160 may be 
configured to detect whether the current thread includes an identifier before commencing 
execution of the current thread. For example, an identifier may be detected in a thread 
prior to the first instruction entering the execution pipeline. 

15 

[0033] If the current thread does not include an identifier, processor 100 continues 
execution of the current thread (block 340). Referring back to block 320, if detection unit 
160 detects an identifier, processor 100 suspends execution of the current thread and 
starts executing a different thread (block 330). Once processor 100 begins execution of 
20 the different thread, detection unit 160 is configured to detect whether the different thread 
includes an identifier (block 320). As described above, detection unit 160 may detect 
whether the different thread includes an identifier before commencing execution of the 
different thread. 

25 [0034] Turning now to FIG. 4, a flow diagram describing the operation of another 
embodiment of the processor of FIG. 1 is shown. Beginning with block 410, processor 
100 is configured to execute one or more threads. Processor 100 determines whether a 
current thread is associated with a global execution parameter (block 420). As noted 
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above, the global execution parameter may be indicative that processor 100 should 
unconditionally switch to another thread if one is awaiting execution. If the current 
thread is not associated with a global execution parameter, then the processor 100 
continues execution of the current thread (block 460). Referring back to block 420, if the 
5 current thread includes a global execution parameter, detection unit 160 is configured to 
detect whether the given thread also includes an identifier (block 430). More particularly, 
detection unit 160 is configured to detect whether one or more instructions of the current 
thread include an identifier or are associated with an identifier. 

1 0 [0035] If the current thread does not include an identifier, execution of the current 
thread is suspended and a different thread is executed (block 440). Referring back to 
block 430, if the current thread includes an identifier, the global execution parameter is 
overridden (block 450) and execution of the current thread is continued (block 460). To 
illustrate using an example, processor 100 continues execution of the current thread even 

1 5 if, for example, there is a cache miss because the identifier may be indicative that the 
cache miss was local and will therefore incur only a short latency. In an alternative 
embodiment, processor 100 may first determine whether the current thread includes an 
identifier and then determine whether the current thread is associated with a global 
execution parameter. 

20 

[0036] Turning now to FIG. 5, a block diagram of another embodiment of a 
multithreading processor is shown. Components that correspond to those shown in FIG. 
1 are numbered identically for clarity and simplicity. In addition to the functional blocks 
shown in processor 100 of FIG. 1, processor 500 of FIG. 5 includes a priority designation 
25 unit 580 which is coupled between decode unit 140 and scheduling unit 150. 

Accordingly, the description of some of the various other blocks that corresponds to the 
description of FIG. 1 above has been omitted for simplicity. As noted above, the 
management of multiple threads in processor 500 may include execution parameters that 
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may control when particular threads are executed and when particular threads are 
suspended. 

[0037] In one embodiment, priority designation unit 580 is configured to determine 
5 the priority level of each thread. In such an embodiment, the priority level of a given 
thread may be assigned during compile time. As such, priority designation unit 580 may 
store the priority of each thread in a storage device such as a register, for example. When 
the instructions of a particular thread are decoded, the priority level may be determined 
and stored. However, it is noted that other methods of associating a particular priority 
10 level to each thread may be used. 

[0038] In another embodiment, priority designation unit 580 is configured to re-assign 
a particular priority level to the current thread based upon conditions associated with the 
execution environment. For example, in one embodiment, priority designation unit 580 
1 5 may assign to the current thread a high priority level or a low priority level relative to 

other threads dependent upon the aging of the current thread. Thus, a priority assigned by 
software may be changed by hardware during execution of the current thread. In such an 
embodiment, priority designation unit 580 may change the priority of the current thread in 
response to detection unit 160 detecting an identifier. 

20 

[0039] Accordingly, regardless of how the priority of a given thread is assigned and/or 
determined, in one embodiment, processor 500 may be configured to selectively continue 
execution of a current thread dependent upon whether detection unit 160 detects an 
identifier and the priority of each thread. For example, if detection unit 160 detects an 
25 identifier in the current thread and if priority designation unit 580 determines (after re- 
assigning priorities) that the priority level of a different thread is higher than the priority 
level of the current thread, processor 500 may switch to the different thread. However, if 
priority designation unit 580 determines that the priority level of the current thread is the 
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same as or higher than the priority level of a different thread, processor 500 may be 
configured to continue execution of the current thread. It is noted that although priority 
designation unit 580 is shown as a separate functional block, the functionality of priority 
designation unit 580 may be included within other functional blocks. 

5 

[0040] In one embodiment, to measure the aging of the current thread, processor 500 
may include counter logic (not shown) that tracks how long a current thread is taking to 
execute. In such an embodiment, counter logic may be configured to count a number of 
execution cycles. If processor 500 is still waiting on the current thread after a 
10 predetermined number of execution cycles, processor 500 may assign the current thread a 
lower priority. It is noted however that in other embodiments, processor 500 may track 
how long a current thread is taking to execute using various other methods, such as 
counting processor clock cycles, for example. 

15 [0041] As illustrated in FIG. 5, detection unit 160 is coupled to priority designation 
unit 580 and detects whether the current thread includes an identifier that is used to 
determine whether to conditionally continue execution of the current thread. In addition, 
detection unit 160 may be coupled to communicate to priority designation unit 580 
whether the current thread includes the identifier. In such an embodiment, priority 

20 designation unit 580 may use the identifier information to assign a new priority level to 
the current thread. However, as shown in FIG. 5, detection unit 160 is coupled to decode 
unit 140, to priority designation unit 580, and to scheduling unit 150 to detect identifiers 
at different points within the execution of the thread; thereby enabling priority 
designation unit 580 to assign new priority levels at different points during the process. 

25 In addition, it is noted that detection unit 160 may be coupled to other components within 
processor 500. 
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[0042] FIG. 6 illustrates a flow diagram describing the operation of one embodiment 
of the processor of FIG. 5. Referring collectively to FIG. 5 and FIG. 6, processor 500 
begins execution of a thread of one or more threads (block 610). Detection unit 160 
detects whether the current thread includes an identifier (block 620). If the current thread 
5 does not include an identifier, processor 500 continues execution of the current thread 
(block 630). However, referring back to block 620, if detection unit 160 detects that the 
current thread includes an identifier, priority designation unit 580 assigns a new priority 
to the current thread dependent upon the execution environment (block 640). 

1 0 [0043] Once the priority of the current thread is changed, priority designation unit 580 
determines whether the current thread has a higher or lower priority than the other 
pending thread(s) (block 650). If the current thread does not have a lower priority than 
the other pending thread(s), processor 500 continues execution of the current thread 
(block 620). In one embodiment, processor 500 continues execution of the current thread 

15 until it finishes executing the current thread or until another identifier is detected (block 
620). Referring back to block 650, if the current thread does have a lower priority than 
the other pending thread(s), processor 500 suspends execution of the current thread and 
executes a different thread (process block 660). 

20 [0044] In one embodiment, if processor 500 determines that a particular thread does 
not have a priority level, then the processor 500 may assign a default priority level to the 
particular thread. In such an embodiment, the default priority level may be the lowest 
priority. 

25 [0045] Turning now to FIG. 7, a diagram of one embodiment of a compiler for 

converting high-level programming instructions to low-level instructions is shown. High- 
level programming instructions 710 may be written in any high-level programming 
language, such as C, or C++, for example. The low-level instructions 720, which are the 
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output of compiler 700, are representative of a machine language, such as a binary code 
that may be readily operated on by a processor. 

[0046] In one embodiment, programmers may write high-level computer programs 
5 including high-level instructions 710 and may include an indicator in one or more of the 
high-level instructions 710. For example, a programmer may have prior knowledge that a 
particular instruction will take a long time to execute or can determine that statistically it 
should take a long time to execute, thereby causing the thread which includes the 
particular instruction to stall. To reduce the effect of such a stall, the programmer may 
10 include an indicator in the thread that is indicative that the thread may be suspended and a 
different pending thread may be executed. 

[0047] In the illustrated embodiment, compiler 700 detects the indicator and creates an 
identifier within the corresponding low-level instructions 720. It is noted that the 

1 5 identifier created by the compiler 700 may have the same properties as the identifiers 
described above in conjunction with the descriptions of FIG. 1 through FIG. 6. For 
example, the identifier that may be associated with the current execution thread may be a 
hint to inform the processor that the execution of the current thread may be suspended 
until it can be more efficiently executed or that a different thread may be executed before 

20 the current thread. Alternatively, the identifier that may be associated with a current 
execution thread may be a hint to inform the processor that the execution of the current 
thread should continue, irrespective of any global execution parameters to the contrary. 
Therefore, compiler 700 can detect the indicators within the high-level code and create 
identifiers, which, when placed into an execution thread, may help manage and guide the 

25 selection of threads in the processor as described above. 

[0048] In one embodiment, the indicator may be a compiler directive or 'pragma'. In 
another embodiment, the indicator may be an assembly language subroutine that may be 
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embedded within the high-level code. In yet another embodiment, the indicator may be a 
unique high-level instruction having the sole function of communicating to the compiler 
that an identifier is required for a particular instruction. In an alternative embodiment, 
compiler 700 may be configured recognize certain instructions as indicators. For 
5 example, compiler 700 may recognize an instruction as one that results in a spin lock. 

[0049] FIG. 8 depicts a flow diagram describing the operation of one embodiment of 
the compiler of FIG. 7. Referring collectively to FIG. 7 and FIG. 8, a computer program 
including high-level programming instructions is provided to compiler 700 (block 810). 
10 Compiler 700 detects whether an indicator is included within the high-level computer 
program (block 820). 

[0050] If complier 700 detects an indicator, compiler 700 generates a low-level 
instruction having an identifier corresponding to the indicator (block 830). As described 
15 above, the low-level identifier is configured to cause the processor to selectively continue 
execution of a given thread. However, referring back to block 820, if compiler 700 does 
not detect an indicator, compiler 700 generates low-level instructions without identifiers 
(block 840). 

20 [0051] Turning now to FIG.9, a block diagram of one embodiment of a computer 

system including a multithreading processor is shown. Computer system 900 includes a 
processor 910 coupled to a main memory 920 via a memory bus 925. Computer system 
900 also includes an I/O bridge 930 that is coupled to processor 910 via a system bus 935, 
In addition, computer system 900 includes a peripheral device 940 that is coupled to I/O 

25 bridge 930 via a peripheral bus 945. It is noted that in one embodiment, processor 910 
may be illustrative of processor 100 of FIG. 1. In another embodiment, processor 910 
may be illustrative of processor 500 of FIG. 5. 
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[0052] In the illustrated embodiment, main memory 920 is a memory medium in 
which application programs are stored and from which processor 910 primarily executes. 
A suitable main memory 920 may include memory devices in the Dynamic Random 
Access Memory (DRAM) family of devices. For example, a plurality of banks of 
5 Synchronous DRAM (SDRAM), Double Data Rate (DDR) SDRAM, or Rambus DRAM 
(RDRAM) devices may be suitable. 

[0053] In addition to main memory 920, computer system 900 may include additional 
memory mediums (not shown) on which computer programs according to various 

10 embodiments may be stored. The term "memory medium" may include an installation 
medium, e.g., a CD-ROM, or floppy disks, additional system memory such as DRAM, 
SRAM, EDO DRAM, SDRAM, DDR SDRAM, Rambus RAM, etc., or a non-volatile 
memory such as a read-only memory (ROM), or magnetic media such as a hard drive, or 
optical storage. The memory medium may include other types of memory as well, or 

1 5 combinations thereof. The memory medium preferably stores a compiled version of a 
software program or programs for event-triggered transaction processing as described 
herein. The software program(s) may be implemented in any of various ways, including 
procedure-based techniques, component-based techniques, and/or object-oriented 
techniques, among others. For example, the software program may be implemented using 

20 ActiveX controls, C++ objects, JavaBeans, Microsoft Foundation Classes (MFC), or other 
technologies or methodologies, as desired. A processor, such as processor 910, executing 
code and data from the memory medium includes a means for creating and executing the 
software program or programs according to the methods and/or block diagrams described 
above. 

25 

[0054] In addition, the memory medium may be located within a second computer 
system (not shown), which connects to computer system 900 over a network. In this 
instance, the second computer provides the program instructions to computer system 900 
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for execution. Also, it is contemplated that computer system 900 may take various forms, 
including a personal computer system, mainframe computer system, workstation, network 
appliance, Internet appliance, personal digital assistant (PDA), television system or other 
device. In general, the term "computer system" can be broadly defined to encompass any 
5 device having a processor that executes instructions from a memory medium. 

[0055] In one embodiment, I/O bridge 930 provides an interface between processor 
910, main memory 920, and peripheral device 940. When an operation is received from 
one of the devices connected to I/O bridge 930, I/O bridge 930 identifies the target of the 
10 operation and routes the operation to the targeted device. I/O bridge 930 generally 

translates an operation from the protocol used by the source device or bus to the protocol 
used by the target device or bus. 

[0056] In the illustrated embodiment, peripheral device 940 may be illustrative of a 
1 5 variety of peripheral devices such as, for example, network interface cards, video 

accelerators, audio cards, hard or floppy disk drives or drive controllers, SCSI (Small 
Computer Systems Interface) adapters and telephony cards. Furthermore, peripheral bus 
945 may be illustrative of a variety of peripheral buses such as a peripheral component 
interconnect (PCI) bus, for example. 

20 

[0057] It is noted that, while the PCI bus has been used as an example in the above 
description, any bus architectures may be substituted as desired. It is further noted that 
computer system 900 may be a multiprocessing computer system including additional 
processors (not shown). The additional processors may be similar to processor 910. 
25 More particularly, the additional processor may be an identical copy of processor 910. 

The additional processor may be connected to I/O bridge 930 via an independent bus (not 
shown) or may share system bus 935 with processor 910. 
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[0058] It is noted that computer system 900 is an exemplary computer system and that 
other computer systems are possible. More particularly, it is contemplated that in other 
embodiments, processor 100 of FIG. 1 and processor 500 of FIG. 5 may be employed 
within other types of computers systems. For example, processors 100 and 500 may be 
5 employed within computer systems having hierarchical memory structures, such as a non- 
uniform memory access (NUMA) architectures. In such systems, each processor may 
have access to a local memory structure in addition to the memory structure of another 
processor. 

10 [0059] Although the embodiments above have been described in considerable detail, 
numerous variations and modifications will become apparent to those skilled in the art 
once the above disclosure is fully appreciated. It is intended that the following claims be 
interpreted to embrace all such variations and modifications. 
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